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ABSTRACT The Trojan horse Escherichia coli antibiotic microcin C (McC) consists of a heptapeptide attached to adenosine 
through a phosphoramidate Unkage. McC is synthesized by the MccB enzyme, which terminally adenylates the ribosomally syn- 
thesized heptapeptide precursor MccA. The peptide part is responsible for McC uptake; it is degraded inside the cell to release a 
toxic nonhydrolyzable aspartyl-adenylate. Bionformatic analysis reveals that diverse bacterial genomes encoding mccB homo- 
logues also contain adjacent short open reading frames that may encode MccA-like adenylation substrates. Using chemically 
synthesized predicted peptide substrates and recombinant cognate MccB protein homologs, adenylated products were obtained 
in vitro for predicted MccA peptide-MccB enzyme pairs from Helicobacter pylori. Streptococcus thermophilus, Lactococcus john- 
sonii, Bartonella washoensis. Yersinia pseudotuberculosis, and Synechococcus sp. Some adenylated products were shown to in- 
hibit the growth of E. coli by targeting aspartyl-tRNA synthetase, the target of McC. 

IMPORTANCE Our results prove that McC-like adenylated peptides are widespread and are encoded by both Gram-negative and 
Gram-positive bacteria and by cyanobacteria, opening ways for analyses of physiological functions of these compounds and for 
creation of microcin C-Uke antibiotics targeting various bacteria. 
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The antibiotic microcin C (McC) (Fig. lA) is produced by 
Escherichia coli strains harboring a plasmid-borne nice 
operon (1, 2). McC consists of a heptapeptide whose C termi- 
nus is covalently linked through a phosphoramidate bond to 
adenosine; the phosphoramidate linker is esterified with an 
aminopropyl moiety (3). McC is active against E. coli strains 
that lack the mee operon as well as other enteric bacteria (1,2). 
McC penetrates the inner membrane of £. coli cells through the 
YejABEF transporter (4) . In the cytoplasm of sensitive cells, the 
N-terminal formyl group of McC is removed by the peptide 
deformylase, followed by removal of N-terminal methionine 
by methionine aminopeptidase and degradation of the rest of 
the peptide by nonspecific aminopeptidases (5). The last re- 
maining amino acid of McC peptide is an aspartate. Thus, fol- 
lowing proteolytic processing, a modified nonhydrolyzable 
aspartyl-adenylate, referred to as processed McC, is released. 
Processed McC is a potent inhibitor of aspartyl-tRNA synthe- 
tase (AspRS) (6), an enzyme essential for protein synthesis. 

While intact McC inhibits the growth of sensitive E. coli cells at 
low micromolar concentrations, processed McC does not affect 
cell growth, even at millimolar concentrations (6). The peptide 
chain therefore promotes active uptake of toxic processed McC, 
which itself is unable to penetrate cells, via the YejABEF trans- 
porter. The biological function of YejABEF (other than McC 
transport) is unknown. 



The McC peptide is encoded by a seven-codon-long mccA gene 
(7). During biosynthesis of the antibiotic, the MccA peptide 
(amino acid sequence MRTGNAN) is C-terminaUy adenylated by 
the MccB synthetase. The reaction proceeds through two rounds 
of adenylation and results in a conversion of terminal asparagine 
residue of the MccA peptide to aspartate (8, 9). The product of 
MccA adenylation is additionally decorated by an aminopropyl 
moiety in a reaction that proceeds through a mechanism yet to be 
defined and requires the products of mecD and mecE genes (10). 
The resulting mature McC is exported from the producing cells by 
MccC, an MFS-type transporter (11-13). Self-immunity to pro- 
cessed McC that accumulates in the cytoplasm of the producing 
cell is determined by the C-terminal domain of MccE, a GNAT 
(GcnS-related acetyl transferase) family acetyltransferase (14), 
and MccF, an S66 family protease (15). The mccF gene is tran- 
scribed separately from the mccABCDE operon and is not strictly 
required for McC production and/or immunity of the producing 
cell (11-13). 

The product of MccB-catalyzed adenylation of MccA, a pep- 
tide adenylate with a molecular mass of 1,119 Da (or 1,091 Da for 
a compound without an N-terminal formyl group) is biologically 
active, albeit its activity is significantly less than that of mature 
McC (10). Thus, the mccD and mccE genes, necessary for the ad- 
ditional aminopropyl decoration, are not strictly required for bi- 
ological function. Therefore, a three-gene arrangement (mccABC) 
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FIG 1 E. coli McC and homologous operons encoded by other bacteria. (A) Schematic representation of E. coli McC synthesis from the MccA peptide. Enzymes 
andcofactors involved are listed above and below the arrows; R represents N-terminal residues of MccA (formyl-MRTGNA). Numbers (in daltons [Da]) indicate 
differences in molecular masses between the products and the substrates of individual reaction steps. (B) At the top, the nice operon of E. coli is shown. Arrows 
indicate individual genes ( tnccA, red; mccB, yellow; mccC, green; mccD, pink; mccE, purple; mccF, blue). Homologous operons from indicated sources are shown 
under the arrows. Open reading frames coding for predicted MccA-like peptides are not drawn to scale. Open reading frames indicated as gray arrows in the 
Y. pseudotuberculosis nice operon correspond to yqcl and yegG gene homologs. The part of the mccB homolog indicated in brown encodes a methylase domain. 
For the Synechococcus sp. CC9605 mcc operon, the three open reading frames indicated in cyan encode ABC-type transporters. 



may fulfill the minimal requirements for biosynthesis and export 
of an McC-like molecule. Earlier, we analyzed bacterial DNA se- 
quences for the presence of adjacent genes coding for proteins 
similar to E. coli MccB and MccC and found such pairs in Helico- 
bacter pylori plasmids pHPM8 and HPP12 and in the genomes of 
Streptococcus thermophilus LMD-9, Lactococcus johnsonii NCC 



533, and Bartonella washoensis Sb944nv (16). No other adjacent 
mcc-like genes were found in these organisms (Fig. IB). Manual 
analysis of regions upstream of mccB homologs from these organ- 
isms identified putative seven-codon-long open reading frames 
(ORFs) whose products contained a C-terminal asparagine (16) 
(Fig. IB and Table 1). The products of these open reading frames 



TABLE 1 Predicted MccA peptides encoded by mcc-like operons of various bacteria" 



Organism/source MccA peptide 



Escherichia coli pMccC7 plasmid MRTGNAN 

Bartonella washoensis Sb944nv MDHIGFN 

Helicobacter pylori HPP12 and pHPM8 plasmids MKLSYRN 

Lactobacillus johnsonii NCC 533 MHRIMKN 

Streptococcus thermophilus LMD-9 MKGTILN 

Yersinia pseudotuberculosis IP32953 MYQVGIILSIKCN 

Yersinia pseudotuberculosis PB 1 / -I- MHQSEIKLTKRLKIKRVDVNKVKEQQKKVLECGAATCGGGSN 

Synechococcus sp. CC9605 MTQPNDRQLSNEELSDVAAGLFRRTFFKPRTSRKTLLQPKRLDKVAKNQLWADMMN 

" For each organism Usted in the left column, the predicted MccB substrates are Usted in the right (single-amino-acid code). Terminal asparagine residues are indicated in bold 
typeface. 
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differ from E. coli MccA but might be substrates for cognate MccB 
adenylate transferases. 

In the genome of Synechococcus sp. CC9605, an mccB-like 
gene is clustered with an mccD-like gene and two genes coding for 
proteins homologous to E. coli mccE N- and C-terminal domains 
(Fig. IB). While no mccC-like genes are present, three genes cod- 
ing for ABC-like transporters are located upstream of the mccB 
homolog. Two open reading frames encoding identical 56-amino- 
acid-long peptides with terminal asparagines are located between 
the ABC-transporter gene cluster and the mcci?-like gene (Fig. IB 
and Table 1). These peptides have been proposed to be adenyla- 
tion substrates for a cognate mccB gene product ( 16). A very sim- 
ilar operon is also encoded by Synechococcus sp. CC9616 (16). 

In Yersinia pseudotuberculosis IP32953, an operon containing 
homologs of £. coli mccBCDE genes is present (Fig. IB). The order 
of the mccB and mccC genes is inverted compared to that observed 
in E. coli mcc and in "minimal" mccABC operons, and two genes 
homologous to those coding for a predicted methylase and func- 
tionally uncharacterized yqcl- and yc^G-encoded family proteins 
are inserted between the mccB and mccD homologs. In addition, 
the homolog of Yersinia pseudotuberculosis IP32953 MccB is also 
fused to a methylase domain. An open reading frame encoding a 
13-amino-acid-long peptide with terminal asparagine was identi- 
fied upstream of the mccChomolog (Fig. IB), and the correspond- 
ing peptide (Table 1) was proposed to be the substrate for cognate 
MccB (16). 

E. coli McC can be prepared in vitro by adenylation of synthetic 
MRTGNAN peptide by recombinant E. coli MccB (8). Here, we 
used this approach to validate predicted mccA-mccB pairs from 
bacteria other than E. coli. We show that adenylated peptides can 
be prepared for most of predicted putative MccA-MccB pairs and 
that many of the resulting adenylated peptides inhibit the growth 
of £. coli in a YejABEF-dependent manner. By analogy with E. coli, 
we propose that many of these peptide adenylates inhibit the 
growth of cognate bacteria that lack mcc-like operons. 

RESULTS 

Enzymatic synthesis of McC-Iike peptide adenylates. Walsh and 
colleagues previously reported the in vitro synthesis of an McC 
maturation intermediate, adenylated MccA peptide lacking the 
aminopropyl decoration and N-terminal formyl group, in reac- 
tion mixtures containing synthetic MRTGNAN peptide, recom- 
binant E. coli MccB protein, and ATP (8). Using their conditions 
and mass-spectrometric analysis of reaction products, we too ob- 
served partial conversion of a mass peak corresponding to reac- 
tion substrate MRTGNAN (m/z = 763.5) into adenylated product 
(m/z = 1,092.5) (Fig. 2A). Accumulation of the reaction interme- 
diate, a peptide containing a C-terminal succinimide (m/z = 745), 
was also observed (indicated by an asterisk in the mass spectrum 
presented in the bottom panel of Fig. 2A), in agreement with ear- 
lier observations (8). 

Next, recombinant versions of MccB homologues encoded by 
H. pylori plasmids pHPM8 and HPP12 and the genomes of S. ther- 
mophilus LMD-9, L. johnsonii NCC 533, B. washoensis Sb944nv, 
Y. pseudotuberculosis IP32953, and Synechococcus sp. CC9605 
were purified using E. coli heterologous expression systems. The 
full-sized Y. pseudotuberculosis IP32953 MccB homolog was very 
poorly expressed. We considered that the unusual terminal fusion 
with a methylase-like protein (Fig. IB) might have affected the 
folding of this protein. Accordingly, a recombinant version of 



Y. pseudotuberculosis MccB lacking the C-terminal methylase do- 
main was prepared. This protein was soluble and expressed at high 
level and was used in subsequent experiments. Predicted peptide 
substrates were prepared by solid-phase chemical synthesis. The 
only exception was the putative substrate peptide encoded by Syn- 
echococcus sp. CC9605, which was prepared as a fusion with the 
MBP protein. Following tobacco etch virus (TEV) protease treat- 
ment, a 58-amino-acid peptide that lacked the N-terminal methi- 
onine residue of the predicted synechococcal MccA homolog and 
contained three extra N-terminal amino acids (SerGlySer) was 
obtained. 

Recombinant MccB homologs were combined with predicted 
peptide substrates in the presence or in the absence of ATP under 
conditions optimized for E. coli MccB-catalyzed reactions. Equal 
amounts of synthetic peptides were used. The amount of recom- 
binant Synechococcus sp. CC9605 peptide was at least 10 times 
lower, since we had difficulties obtaining sufficient amounts of 
this peptide. The results of mass-spectrometric analysis of reac- 
tion products are presented in Fig. 2. 

The predicted seven-amino-acid MccA-like peptides encoded 
by B. washoensis Sb944nv (MDHIGFN), S. thermophilus LMD-9 
(MKGTILN), and L. johnsonii NCC 533 (MHRIMKN) were read- 
ily adenylated by cognate enzymes (Fig. 2B, D, and E). The pre- 
dicted seven-amino-acid MccA-like peptide MKLSYRN encoded 
by H. pylori plasmids pHPM8 and HPP 1 2 was adenylated by MccB 
encoded by the HPP12 plasmid (Fig. 2C). The pHPM8-encoded 
protein MccB was inactive (data not shown). The pHPM8- 
encoded MccB lacks 46 N-terminal amino acids present in the 
HPP12-encoded homolog. The corresponding N-terminal amino 
acids of £. coli MccB form a peptide clamp domain that is involved 
in MccA binding (9). Analysis of mccB genes from H. pylori plas- 
mids revealed that the pHPM8 -borne copy of the gene had under- 
gone an A-to-G substitution that destroyed the starting codon of 
the HPP 12 mccB ORE as well as a 4-nucleotide insertion in an area 
between the annotated start codons of the HPP 12 and pHPM8 
homologs. The absence of intact N-terminal domain in pHPM8- 
encoded MccB explains its inactivity. The corresponding gene is 
thus defective and should be considered a pseudogene, while the 
mcc operon borne by the HPM8 plasmid must be inactive for 
production of McC -like compound. 

The 58-amino-acid putative Synechococcus sp. CC9605 sub- 
strate peptide was fully adenylated by the cognate enzyme 
(Fig. 2G). 

The 13-amino-acid-long Y. pseudotuberculosis IP32953 MccA 
peptide predicted earlier (16) was not modified by the cognate 
MccB protein ( data not shown) . A more current annotation of the 
Y. pseudotuberculosis PB1/-I- genome (NC_010634.1) suggests a 
longer 42-codon peptide (YP_001872354.1) as a possible sub- 
strate of adenylation by MccB. The same polypeptide is also en- 
coded by Y. pseudotuberculosis IP32953. When the longer peptide 
was prepared by chemical synthesis and tested in the adenylation 
reaction, highly efficient conversion into adenylated product was 
observed (Fig. 2F). The result thus validates the 42-amino-acid- 
long peptide as a substrate of Y. pseudotuberculosis IP32953 MccB. 

In all cases where adenylation products were detected, corre- 
sponding succinimide-containing intermediates (-18 Da of input 
peptide) were also observed, indicating that the reaction mecha- 
nism of MccB-like enzymes is conserved. The only exception was 
Synechococcus sp. CC9605 substrate peptide, which was quanti- 
tatively converted to adenylated product with no reaction inter- 
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FIG 2 In vitro adenylation of predicted MccA peptides by cognate MccB homologs. Chemically synthesized peptides corresponding to 7-amino-acid-long 
MccA products of £. coXi (A), B. washoensis Sb944nv (B), H. pylori plasmid HPP12 (C), I. johnsonii NCC 533 (D), S. thermophilus LMD-9 (E), Y. pseudotuber- 
culosis PB1/+ (F), full-length Synechococcus sp. CC9605 peptide (G), and peptides corresponding to the last 25 (H) and 20 (I) amino acids of Synechococcus sp. 
CC9605 peptide (Table 1 ) were combined with the corresponding recombinant MccB homologs in the absence (top) and the presence (bottom) of ATP. Reaction 
products were analyzed by MALDI MS. Only relevant parts of the spectra are shown. Asterisks indicate peaks corresponding to adenylation reaction interme- 
diates containing a terminal succinimide (Fig. lA). 



mediate detected. Adenylation reactions were specific; i.e., pep- 
tides became adenylated when incubated with enzymes from the 
same species but remained unmodified when incubated with non- 
cognate enzymes (data not shown). With the exception of Syn- 
echococcus sp. CC9605 peptide, no full modification of input 
peptides was achieved under our conditions (Fig. 2). Nevertheless, 
whenever modification was observed, a significant fraction of in- 
put peptide was adenylated that was comparable to what was ob- 
served in the control reaction with E. coU MccA-MccB. 

Biological activity of peptide adenylates. The results of enzy- 
matic adenylation in vitro validated most of the earlier bioinfor- 
matics predictions. We envision that adenylated peptides with or 
without additional modifications, such as the propylamine deco- 
ration present in E. coli McC, are produced by bacteria containing 



validated mcc-like operons and inhibit the growth of bacteria that 
do not carry the mcc-like operon. While we lack the capacity to test 
the homologous biological activity of adenylation reaction prod- 
ucts, we tested their ability to inhibit the growth of E. coli. In vitro 
adenylation reaction mixtures containing equal amounts of input 
substrate peptide and cognate adenylation enzymes were incu- 
bated with or without ATP, and aliquots of reaction mixtures were 
deposited on freshly seeded lawns of E. coli B cells (Table 2). Only 
reaction mixtures containing enzyme-substrate pairs that resulted 
in adenylated peptide production were tested. As a control, we 
used an McC-resistant isogenic strain lacking the yejB gene (4) . As 
can be seen from Table 2, reaction mixtures containing adenylated 
peptides from E. coli, S. thermophilus LMD-9, L. johnsonii NCC 
533, and H. pylori inhibited the growth of wild-type E. coli cells. 
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TABLE 2 Antibiotic activity of enzymatically synthesized McC-like compounds" 



E. coli 



Source 



Peptide adenylate 



E. coli B E. coli B 
AyejB AsbmA 



E. coli MRTGNAD-AMP 
B. washoensis Sb944nv MDHIGFD-AMP 

H. pyloriFU MKLSYRD-AMP 

I. johnsonii NCC 533 MHRIMKD-AMP 
S. thermophilus LMD-9 MKGTILD-AMP 

Yersinia MHQSEIKLTKRLKIKRVDVNKVKEQQKKVLECGAATCGGGSD-AMP 
pseudotuberculosis 
PB1/+ 
Synectiococcus sp. 

CC9605* 
Synechococcus sp. 

CC9605* 
Synectiococcus sp. 

CC9605* 
E. coli mcb 



+ 

+ 
+ 



SGSTQPNDRQLSNEELSDVAAGLFRRTFFKPRTSRKTLLQPKRLDKVAKNQLWADMMD-AMP 
SRKTLLQPKRLDKVAKNQLWADMMD-AMP 
LQPKRLDKVAKNQLWADMMD-AMP 



Microcin B 



" For each peptide adenylate listed, the results of biological activity testing of the indicated E. coli strain lawns are shown. For activity testing, 10 jj,l of the adenylation reaction 
mixture was deposited on freshly seeded lawns and, after an overnight incubation, growth inhibition zones around the positions where adenylation reaction mixture aliquots were 
deposited were recorded. +, clear growth inhibition zones were observed; — , no inhibition zones were observed. For Synechococcus sp. CC9605, an adenylation reaction was 
conducted with a recombinant peptide containing three nonnatural N-terminal amino acids (indicated in bold typeface; see Materials and Methods). The Synechococcus sp. 
CC9605 lines {mariied with an asterisic) show results obtained with adenylated peptides corresponding to the last 25 and 20 amino acids of Synechococcus sp. CC9605 MccA. 



While the efficiencies of adenylation of the various peptides dif- 
fered (Fig. 2), there was no gross difference in the sizes of the 
growth inhibition zones (8 to 15 mm in diameter), suggesting that 
the ability of various adenylated peptides to inhibit E. coli is com- 
parable to that of McC. No inhibition zones were observed on 
lawns of mutant cells, suggesting that adenylated peptides from 
S. thermophilus LMD-9, L. johnsonii NCC 533, and H. pylori enter 
E. coli through the YejABEF transporter. 

Adenylated B. washoensis Sb944nv and Y. pseudotuberculosis 
IP32953 peptides produced no inhibition zones on either wild- 
type or mutant cell lawns, suggesting that these peptides either are 
not internalized or are not processed within the cell (see below). 
An adenylated MccA homolog from Synechococcus sp. CC9605 
also produced no growth inhibition zones. However, as men- 
tioned above, the amount of this peptide used in the adenylation 
reaction was much lower than the amount of synthetic peptides, 
which could have affected the result of biological activity testing. 
Synthetic peptides corresponding to 25, 20, 15, and 10 C-terminal 
amino acids of synechococcal MccA were prepared and tested in 
the adenylation reaction with Synechococcus sp. CC9605 MccB. 
Mass-spectrometric analysis indicated that the two longer pep- 
tides were efficiently adenylated (Fig. 2H and I), while the shorter 
peptides remained unmodified (data not shown). Reaction mix- 
tures containing an adenylated 25-amino-acid peptide inhibited 
the growth of both the wild-type and yejB mutant E. coli strains 
with equal efficiencies (Table 2). In contrast, an adenylated 20- 
amino-acid peptide was inactive (Table 2). Since data determined 
with heptapeptide adenylates suggest that the YejABEF trans- 
porter appears to be sequence nonspecific, the result hints that 
there may be an additional route of cell entry for longer peptide 
adenylates. Indeed, E. coli cells with a disrupted sbmA gene coding 
for an inner-membrane transporter were sensitive to McC but 
were resistant to the action of an adenylated 25-amino-acid pep- 
tide. As expected, these cells were also resistant to microcin B, a 
much longer posttranslationally modified microcin that is known 
to be internalized through SbmA (17). 



In vitro activity of adenylated peptides. To determine if vari- 
ous adenylated peptides function similarly to the E. coli McC, 
aliquots of adenylation reaction mixtures were combined with S30 
extracts of £. coli K-12 wild-type cells, incubated for 15 min, and 
tested for aminoacylation of tRNA^''?, a reaction catalyzed by the 
target of processed McC, AspRS. Elsewhere, we show that a 15- 
min incubation is sufficient for McC processing under our reac- 
tion conditions (5, 6). In Fig. 3, the ratios of AspRS activities in 
extracts to which cognate MccB-MccA pair reaction mixtures 
with or without ATP were added are shown. As can be seen, the 
addition of reaction mixtures containing E. coli MccA, MccB, and 
ATP inhibited AspRS activity at a level ca. 10 times higher than 
that seen with the control (no ATP). Comparable levels of inhibi- 
tion were observed in reaction mixtures containing all other ad- 




FIG 3 Adenylated peptides from various sources inliibit E. coli AspRS in vitro. 
AspRS-catalyzed aminoacylation of tRNA^^P in S30 extracts prepared from 
wild-type E. coli was performed in the presence of aliquots of the indicated 
adenylation reaction mixtures with or without ATP. Extracts were incubated 
for 15 min to allow processing. The ratios of acid-insoluble radioactivity in the 
corresponding reaction mixtures with and without ATP are presented (mean 
values and standard deviations of measurements taken from three indepen- 
dently performed experiments are shown). 
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FIG 4 Differential processing rates of E. colt and B. washoensis Sb944nv MccA 
peptides in E. coU extracts. Equal amounts of E. coli and B. washoensis Sb944nv 
MccA peptides were combined with wild-type E. coli S30 extracts. At the times 
indicated, aliquots of reaction mixtures were removed and analyzed by mass 
spectrometry. H, D, and M stand for histidine, aspartate, and methionine 
residues, respectively. 

enylated peptides with the exception of that from B. washoensis 
Sb944nv, which showed poor inhibition (55% AspRS activity 
compared to the no- ATP control), and that from Y. pseudotuber- 
culosis (90% AspRS activity compared to the no-ATP control). 

We hypothesized that the poor inhibition of AspRS by adeny- 
lated B. washoensis Sb944nv and Y. pseudotuberculosis peptides 
was due to slow processing of the peptide in E. coli S30 extracts. 
This hypothesis was tested for the B. washoensis Sb944nv peptide 
by monitoring the fate of unmodified E. coli and B. washoensis 
Sb944nv MccA peptides in E. coli S30 extracts. As can be seen in 
Fig. 4, the N-terminal methionine is efficiently removed from 
both peptides (Fig. 4, second row). A mass peak corresponding to 
the resulting E. coli hexapeptide RTGNAN {m/z = 632.5) com- 
pletely disappeared after a 15-minute incubation with the extract 
due to processing by aminopeptidases, as expected (5). In con- 
trast, the B. washoensis Sb944nv DHIGFN hexapeptide (m/z = 
702) remained essentially intact even after a 60-min incubation, 
indicating that processing (and, therefore, release of C-terminally 
located AspRS inhibitor in the case of adenylated peptide) was 
inefficient. 

Expanding the family of microcin maturation adenylate 
transferases. Encouraged by successful verification of several pre- 
dicted microcins, we decided to revisit the MccB-like family of 
enzymes in order to identify those that could be involved in pro- 
duction of McC-lrke compounds. MccB-like proteins belong to a 
ThiF/HesA/MoeB/El superfamUy of phosphotransferases, which 
has been analyzed in detail previously (18). In particular, it has 
been shown that this superfamily has a complex evolutionary his- 
tory and is so divergent that phylogenetic reconstructions based 
on sequence alignments alone are impossible. The relationships 
between the families must be therefore elucidated based on anal- 
ysis of conserved sequence features, domain organization, and 
gene context (18). In particular, it has been indicated that the 
MccB family is related to the PaaA (pantocin A biosynthesis pro- 
tein) family (18). Unlike MccB proteins, which adenylate terminal 
asparagines in peptides, the PaaA enzyme from Pantoea agglom- 



erans appears to modify an internal asparagine residue of the sub- 
strate peptide (19,20). 

We systematically identified homologs of both MccB and PaaA 
proteins in fully sequenced bacterial genomes (see Materials and 
Methods). The resulting tree of the MccB and PaaA homologs is 
displayed in Fig. 5. All MccB and PaaA homologs included in the 
tree contain a predicted N-terminal peptide clamp domain that in 
E. coli MccB is required for MccA recognition (9). Further, the 
genes coding for MccB and PaaA homologs included in the tree 
have neighboring genes that encode either a potential efflux pump 
or proteins that could be involved in additional microcin modifi- 
cations and/or self-immunity (Fig. 5). These features are not 
found in members from the outgroup, which includes sequences 
that are confidently separated in both FastTree and RAxML trees 
(the bootstrap values are 99 and 69, respectively) and that are 
likely involved in thiamine and molybdenum cofactor biosynthe- 
sis. We therefore predict that the proteins presented in Fig. 5 are 
involved in the synthesis of McC-like compounds. Indeed, while 
this was not implicit in our analysis, all operons encoding premi- 
crocin peptides that were predicted before and verified in our 
work are present in the tree. The corresponding peptides are 
shown in the blue font in Fig. 5 (the prototypical E. coli MccA, and 
the pantocin precursor peptide are shown in the red font). Fur- 
thermore, in front of genes coding for some family members, we 
were able to identify short open reading frames coding for pep- 
tides of various lengths and containing terminal or internal aspar- 
agine residues (highlighted in the brown font). We predict that 
some of these peptides may be adenylation substrates for cognate 
MccB and PaaA homologs. 

DISCUSSION 

In this work, we validated bioinformatically predicted microcin 
C-like compounds encoded in the genomes and/or plasmids of 
several diverse bacteria. All four predicted heptapeptides tested, 
from H. pylori, S. thermophilus, L. johnsonii, and B. washoensis, 
were adenylated by cognate enzymes, and adenylated products 
inhibited the growth of McC-sensitive E. coli through the Trojan 
horse mechanism that was earlier described for E. coli McC (the 
mechanism facilitated transport through YejABEF transporter, 
intracellular processing by aminopeptidases, and inhibition of 
AspRS by a nonhydrolyzable aspartyl adenylate). In the case of 
E. coli McC, the enzymatic product of the MccB-catalyzed reac- 
tion mixture contains a direct N-P link that renders it a nonhy- 
drolyzable analog of the normally hydrolyzable Asp-AMP. While 
none of the analogs obtained in our work have been directly 
shown to contain an N-P bond, the fact that robust inhibition of 
AspRS is observed upon processing strongly supports this as- 
sumption. 

The B. washoensis McC-like compound was the least active of 
the heptapeptide adenylates. Apparently, a hexapeptide with an 
N-terminal aspartate that is produced after the N-terminal methi- 
onine of B. washoensis MccA is removed is poorly processed in 
E. coli. This conjecture agrees with the fact that PepN, which is 
considered the major aminopeptidase in E. coli (21), disfavors 
acidic residues and instead strongly prefers basic amino acids (21, 
22). Indeed, amino acids in the second position of efficiently pro- 
cessed E. coli, H. pylori, S. thermophilus, and L. johnsonii MccA 
peptides are basic, and none of these peptides contain internal 
acidic residues that could hinder processing. 

By analogy with E. coli McC, we hypothesize that the physio- 
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FIG 5 Phylogenetic tree and gene neighbors of predicted members of iVIccB/PaaA family. The maximum-likelihood phylogenetic unrooted tree was built using 
the FastTree program (33) and multiple alignment of conserved blocks (136 informative positions) of the ThiF/HesA/MoeB/El domain. The same program was 
also used to compute the bootstrap values indicated for all branches. The tree is rooted using outgroups (shown as triangles) of sequences with functions 
unrelated to microcin production; the bootstrap value confidently separating the outgroup is highlighted. Each terminal node of the tree is labeled by the 
GenBank Identifier number and the full systematic name of an organism. The founding MccB and PaaA genes are highlighted by red. Proteins that are discussed 
in this work are shown by blue highlights. Domain fusions are indicated in parentheses. One gene encoded in the same predicted operon suggesting relevance to 
microcin production is shown in the column on the right side of the tree. The symbol " -I- " indicates that there are other genes in the same operon that are related 
to microcin biosynthesis or immunity. Predicted and verified microcin precursor peptides are shown as follows: red, McC and pantocin precursors, with arrows 
indicating asparagine residues that are adenylated; blue, peptides validated in this work; brown, peptides predicted upstream of the corresponding MccB/PaaA 
family gene. Abbreviations: for the predicted efflux pumps, ABC, ABC-transported family, MFS, major facilitator transporter family, and DMT, divalent metal 
transporter family; for the predicted immunity genes, zincin, zincin superfamily Zn-dependent proteases, S8, S8 family of serine proteases, MccF, S66 family 
serine protease, and GNAT, GNAT family acetyltransferase; and for the predicted microcin modification enzymes, ThiF-like, ThiF/HesA/MoeB/El protein, 
MTase, S-adenosyl-L-methionine-dependent methyltransferase, and McbC, microcin B-processing oxidoreductase. 



logical function of adenylated heptapeptides validated in our work 
is to target bacteria closely related to the producing strain. The 
corresponding operons have the minimal mccABC arrangement, 
suggesting that heptapeptide adenylates function without the ad- 
ditional modification characteristic of E. coli McC. In the case of 
E. coli McC, the aminopropyl decoration increases the bioactivity 
ca. 1 0-fold by promoting the interaction of processed McC with its 
target through a favorable electrostatic interaction between the 
amine of aminoproryl and negatively charged residues of AspRS 



(10). The corresponding residues are present in AspRS enzymes 
from H. pylori, S. thermophilus, L. johnsonii, and B. washoensis, 
suggesting that the presence of aminopropyl would have increased 
the potency of McC-like molecules produced by these organisms. 
The absence of mccD and -NTD aminopropyl synthesis genes, as 
well as autoimmunity genes mccE-CTD and mccF, suggests that 
the minimal mccABC arrangement is ancestral and that the more 
complex arrangement of E. coli is derived. Stand-alone homologs 
of mccE and mccF are encoded in many bacterial genomes and 
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could have been "adopted" by the ancestor of the E. coli mcc 
operon. 

Perhaps the most surprising result of our work is the validation 
of adenylation of the much-longer MccA peptides from Synechoc- 
occus and Y. pseudotuberculosis. The mcc operon from Synechococ- 
cus is unique since the mccA gene is tandemly duplicated. In addi- 
tion, the likely export pumps are encoded by transporters of a type 
different from those in other validated adenylated-peptide bio- 
synthetic operons (Fig. IB), which may have to do with the greater 
length of the peptide moiety. Structural analysis of E. coli MccB 
bound to its substrate shows that there is an opening in the en- 
zyme that should allow the N-terminally extended 7-amino-acid 
MccA to bind to the enzyme (9). Our results show that in the case 
of synechococcal enzyme, the substrate must be at least 20 amino 
acids in length to be adenylated, indicating that enzyme-substrate 
interactions removed further from the catalytic center than in that 
of E. coli MccB are required. The presence of the mccD homolog 
and two genes corresponding to separate domains of MccE sug- 
gests that the McC-like compound produced by Synechococcus 
maybe aminopropylated. A recent report suggests that Synechoc- 
occus sp. CC9605, when cocultured with other synechococci that 
lack the mcc operon, inhibits the growth of those organisms and 
that mcc genes are responsible for the inhibition (23). Mass- 
spectrometric analysis of Synechococcus sp. CC9605 cultured me- 
dium failed to reveal a mass peak corresponding to full-sized ad- 
enylated MccA (our unpublished observations). However, it is 
possible that the initial adenylated peptide produced in the reac- 
tion catalyzed by Synechococcus sp. CC9605 MccB is proteolyti- 
cally modified and that the active compound is actually much 
smaller. 

Phylogenetic analysis shows that Y. pseudotuberculosis MccB is 
most closely related to Synechococcus sp. CC9605 MccB. The 
Y. pseudotuberculosis mcc operon has several distinct features, in- 
cluding fusion of MccB with a predicted methylase domain and 
two additional genes (one of which is another predicted methylase 
gene) with an uncharacterized function(s). Since we were unable 
to show the in vitro function of full-sized Y. pseudotuberculosis 
MccB, it is formally possible that Y. pseudotuberculosis is inacti- 
vated by these unique insertions. An alternative and more inter- 
esting possibility is that extra methylase homologs participate in 
additional modifications of Y. pseudotuberculosis peptide adenyl- 
ate. The adenylated 42-amino-acid Y. pseudotuberculosis MccA 
protein is inactive against E. coli. It is also unable to inhibit ami- 
noacylation of tRNA^'^P in E. coli extracts. The greater length of 
Y. pseudotuberculosis MccA is unlikely to be responsible, since the 
even longer Synechococcus sp. CC9605 peptide adenylate is read- 
ily processed. Thus, as in the case of B. washoensis, Y. pseudotuber- 
culosis MccA sequence likely prevents initial processing. In this 
case, Y. pseudotuberculosis MccA processing is probably compli- 
cated by the formation of a disulfide bond between the two cys- 
teines present in the peptide (see Table 1). Cocultivation experi- 
ments similar to those performed with Synechococcus (23) will be 
necessary to determine if the mcc operon mediates antagonistic 
interactions within the Yersinia genus, and comparative analysis 
of low-molecular-mass compounds produced by Y. pseudotuber- 
culosis with and without the mcc operon should help to elucidate 
the structure of the active compound, which may differ from that 
of the adenylated peptide obtained here by the in vitro approach. 

E. coli McC penetrates sensitive cells through the YejABEF 
inner-membrane transporter, and it was proposed by us earlier 



that the presence of such transporter is a good predictor of McC 
sensitivity (4). For organisms with validated MccA-MccB pairs, 
clear homologs of YejABEF are detected only in H. pylori. The 
transport systems responsible for uptake of McC-like compounds 
in B. washoensis, S. thermophilus, and L. johnsonii remain to be 
identified, but it is clear that they must be different from YejABEF. 
Cyanobacteria are highly sensitive to McC (23) (O. Bantysh, un- 
published), despite the lack of clear YejABEF homologs, again 
emphasizing that transporters other than YejABEF can import 
peptide adenylates. Our analysis of shortened versions of Syn- 
echococcus sp. CC9605 peptide adenylate revealed, curiously, that 
when peptide length exceeds a certain critical value (20 amino 
acids in our case), adenylated peptides cease to be transported by 
the YejABEF transporter. On the other hand, when peptide length 
reaches a certain critical value (25 amino acids in our case), ad- 
enylated peptides can be transported by SbmA. When the peptide 
length becomes even longer (52 amino acids for full-length Syn- 
echococcus sp. CC9605 peptide adenylate), SbmA becomes un- 
able to perform the transport function. While at the present stage 
these observations cannot exclude the effects of sequence and 
charge, they nevertheless suggest that, by alterations of the length 
of the transport moiety of the McC-type Trojan horse antimicro- 
bials, one can prepare peptide adenylates that enter bacteria 
through different routes, which could help counter bacterial resis- 
tance to this class of compounds. 

The results of our bioinformatics analysis suggest that enzymes 
of the MccB/PaaA family that are capable of adenylation of aspar- 
agine residues in short ribosomally synthetized peptides are very 
abundant in completely sequenced genomes and are present in 
many bacterial phyla, including numerous human parasites and 
symbionts. Our list is by no means exclusive, since it is entirely 
possible that we missed some of the most divergent members of 
the MccB/PaaA family in this analysis. We predict that several 
peptides with asparagine residues that are encoded directly up- 
stream of some MccB/PaaA family genes are adenylation sub- 
strates. In vitro adenylation of synthetic peptides by cognate MccB 
enzymes provides a robust method for experimental validation of 
these predictions. Nevertheless, many MccB/PaaA family genes 
lacked identifiable neighboring premicrocin peptide genes. This 
suggests either that a different amino acid within an upstream 
peptide-encoding gene could be adenylated or that substrate pep- 
tides could be encoded elsewhere in respective genomes. The latter 
phenomenon was reported for thiazole-containing heterocyclic 
bacteriocins of bacilli (24). 

MATERIALS AND METHODS 

DNA, molecular cloning, and microbial techniques. H. pylori plasmids 
pHMGS and HPP12 were provided by Sarah A. Mclntire (Texas Woman's 
University) and Wolfgang Fischer (Max von Pettenkofer-Institut), re- 
spectively. L. johnsonii NCC 533 was provided by Stephane Duboux and 
Raymond David Pridmore (Nestle Research Center), Synechococcus sp. 
CC9605 was provided by Brian Palenik (Scripps Institution of Oceanog- 
raphy, University of California, San Diego [UCSD]), genomic DNA of 
Y. pseudotuberculosis IP32953 was provided by Elisabeth Carniel (Institut 
Pasteur), and genomic DNA of B. washoensis Sb944nv was provided by 
Michael Kosoy (Centers for Disease Control and Prevention, NIH). 

ORFs of £. coli mccB orthologs were amplified using appropriate prim- 
ers and cloned in pET28a( + ) expression vector between polylinker re- 
striction sites Ndel and BamHI (S. thermophilus LMD-9), Ncol and Hin- 
dlll (H. pylori pHMG8), Ncol and Xhol (H. pylori pHPP12 and 
L. johnsonii NCC 533), Ndel and Xhol (B. washoensis Sb944nv), Ndel and 
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BlpI (Synechococcus sp. CC9605), and Ncol and NotI (full-sized Y. pseu- 
dotuberculosis IP32953). E. coli mccB was cloned between Ncol and 
BamHI of pET32b vector. A fragment of Y. pseudotuberculosis mccB 
(codons 2 to 347) was expressed as a fusion with MalE. To obtain the 
fusion protein, a fragment of the £. coli malE gene coding for mature MBP 
was PGR amplified and inserted between the Ndel and BamHI restriction 
sites of the pET22a vector to generate pET22MBP. Next, the PCR- 
amplified DNA fragment of Y. pseudotuberculosis mccB was introduced 
between the BamHI and Xhol restriction sites of pET22MBP. 

Plasmid p28MBP was a generous gift of Satish Nair. Cells harboring 
this plasmid produce an MBP C-terminally fused to Synechococcus sp. 
CC9605 mccA. 

To disrupt the yejB and sbmA genes in a laboratory E. coli B strain 
hypersensitive to McC (6), we used a method described by Datsenko and 
Wanner (25) with pKD46, pKD3, pKD13, and pCP20 plasmids and the 
following primers: primer pair YejB_Del_Forl (5' ATGGGCGCTTACC 
TGATTGGCCGTCTGTTGCTGGTGATCGTGTAGGCTGGAGCTGCT 
TO 3') and YejB_DeLRevl (5' TTAACGTGCCTCAAAATCAATAGGC 
GGATCAACCAGCGTCATATGAATATCGTGCTTAG 3') and primer 
pair SbmA_Del_For (5' TGTTAAAACGATAAGAAGTTAGCAGGAGT 
GGATATGTTACACGTCTTGAGCGATTG 3') and SbmA_DeLRev 
( 5 ' GGTTAGTTGCTGAATTTTGTGACCATCCAGGTCATGATTCCG 
GGGATCCGTCGACCTG 3'). 

Peptides. Synthetic peptides were purchased from LLC Syneuro, Rus- 
sia, and were at least 98% pure. The 42-amino-acid-long MccA peptide of 
Y. pseudotuberculosis PB1/+ was purchased from GenScript USA Inc. and 
was more than 85% pure. Peptides were dissolved in deionized water to up 
to a 15 mM concentration and stored at — 20°C until further use. 

To prepare Synechococcus sp. CC9605 MccA peptide, the p28MBP 
plasmid was transformed in E. coli BL21(DE3) cells. Cells were grown at 
37''C in LB broth with 1% glucose and 50 p.g/ml kanamycin until the 
optical density at 600 nm (ODgog) reached 0.6. Cells were harvested, 
washed thrice with fresh LB, and resuspended in 200 ml of fresh LB broth 
supplemented with kanamycin and 0.3 mM IPTG (isopropyl-jS-D- 
thiogalactopyranoside) . After overnight growth at 18°C, cells were har- 
vested and resuspended in 20 ml of column buffer (20 mM Tris-HCl 
[pH 7.4], 200 mM NaCl, 1 mM EDTA, 1 mM sodium azide, 0.01 M 
j3-mercaptoethanol, and 0.1 mM phenylmethylsulfonyl fluoride 
[PMSF]). Cells were disrupted by sonication. After centrifugation 
(30,000 X g for 30 min at +4°C), the supernatant was combined with 
500 fxl of Amilose resin (NEB) equilibrated in the same buffer and pro- 
teins were allowed to bind for 40 min at 4°C with gentle agitation. The 
resin was settled by gravity and washed with column buffer, and bound 
protein was eluted in column buffer supplemented with 10 mM maltose. 
Eluted material was dialyzed against 50 mM Tris-HCl (pH 8.0)-0.1 mM 
EDTA at +4°C, supplemented with 5 mM dithiothreitol (DTT), and 
treated with AcTEV protease (Invitrogen) according to the manufactur- 
er's suggestions. The mixture was incubated at +4°C overnight and was 
then dialyzed against 75 mM Tris-HCl (pH 8.0)-50 mM MgClj at +4°C. 
The MccA peptide was separated from MBP by using an Ultracel-30K 
Amicon Ultra centrifugal filter (MiUipore). The filter flowthrough con- 
taining the separated peptide was concentrated on a rotary vacuum evap- 
orator. Dried peptide was dissolved in water and stored at — 20°C until 
further use. 

Protein expression and purification. All MccB proteins were ex- 
pressed in E. coli BL21(DE3) cells. To obtain soluble MccB proteins from 
H. pylori, L. jolinsonii NCC 533, Synechococcus sp. CC9605, Y. pseudotu- 
berculosis IP32953, and B. washoensis Sb944nv expressing cells were 
cotransformed with a compatible pG-KJE8 vector (from a Chaperone 
plasmid set; TaKaRa Bio inc.) expressing GroEL/GroES, DnaK, DnaJ, and 
GrpE chaperones. The chaperone-expressing plasmid was found to in- 
crease the yield of soluble MccB homologs. The BL21(DE3) cells were 
grown at 37°C on 200 ml LB medium supplemented with 1% glucose and 
necessary antibiotics until the ODggo reached 0.6. Cells were harvested by 
centrifugation, washed thrice with fresh LB medium, and resuspended in 



200 ml of fresh LB with antibiotics- 0. 1 mM IPTG. Where needed, 5 ng/ml 
tetracycline and 2 mg/ml arabinose were added to induce chaperone ex- 
pression. Cells were grown for 20 h at 18°C with vigorous agitation. Cells 
were harvested and resuspended in 8 ml of loading buffer (20 mM Tris- 
HCl [pH 8.0] , 500 mM NaCl, 10 mM MgClj) and disrupted by sonication. 
The lysates were cleared by centrifugation at 30,000 X g for 30 min at 
+4°C. To the supernatants, 300 /j,l of His Bind resin (Novagen) equili- 
brated in the same buffer was added, and proteins were allowed to bind for 
4 h at 4°C with gentle agitation. The resin was allowed to settle by gravity 
and washed with 15 ml of wash buffer (20 mM Tris-HCl [pH 8.0], 
500 mM NaCl, 10 mM MgClj, 50 mM imidazole), and bound proteins 
were eluted with 0.5 ml of elution buffer (20 mM Tris-HCl [pH 8.0], 
500 mM NaCl, 10 mM MgClj, 200 mM imidazole). Four consecutive 
elutions were performed with each resin sample. Fractions containing 
maximal adenylation activity (see below) were combined, supplemented 
with glycerol up to 50%, and stored at — 20°C until further use. The final 
concentration of MccB proteins ranged between 1 and 5 mg/ml. Proteins 
were at least 90% pure as judged by visual inspection of overloaded 
Coomassie-stained SDS gels. 

In vitro adenylation assays. Reaction mixtures contained (in 100 fjl) 
7.5 mM Tris-HCl (pH 8.0), 5 mM MgClj, 4 mM ATP (where appropri- 
ate), 2.5 mM TCEP [Tris(2-carboxyethyl) phosphine hydrochloride] , and 
a 225 /iM concentration of the appropriate MccA peptide (a concentra- 
tion of fuU-sized Synechococcus sp. CC9605 peptide that was 10 times 
lower was used). Reactions were initiated by the addition of an MccB 
protein (5 fxM final concentration), and reaction mixtures were incubated 
at 23°C for 20 h. Reactions were terminated by transferring them to 
-20°C. 

MALDI-MS and MS/MS analysis. Mass spectra were recorded on an 
Ultraflextreme matrix-assisted laser desorption ionization-tandem time 
of flight (MALDI-TOF-ToF) mass spectrometer (MS; Bruker Daltonik) 
equipped with an Nd laser (355 nm). The MH+ molecular ions were 
measured in reflector mode; the accuracy of monoisotopic mass peak 
measurement was 50 ppm. Aliquots (1 /xl) of desalted adenylation reac- 
tion mixtures were mixed on a steel target with 0.5 /j,l of 2,5- 
dihydroxybenzoic acid (Aldrich) solution (20 mg/ml in 30% MeCN- 
0.5% trifluoroacetic acid [TFA]). 

In vivo sensitivity test. E. coli B wild-type, AyejB, or AsfcmA cells were 
grown in 10 ml of M63 broth supplemented with yeast extract at 37°C to 
an ODgQo of ~1. A 75Q-fjl volume of cell culture was added to 15 ml of 
melted top agar (0.65 g/liter of agar in M63 broth) cooled to ~50°C. The 
mixture was poured on the surface of an LB agar plate. After the agar 
solidified, 10-fx\ drops of solutions of 50 fxM McC'"'-^ (used as a positive 
control) or adenylation reaction mixtures (see above) were placed on a 
plate surface and allowed to dry. Plates were incubated for 4 to 6 h at 37°C, 
and growth inhibition zones around the sites where samples were applied 
were visually detected. 

Preparation of S30 cell extract. McC-sensitive E. coli BW25 11 3 and an 
McC-resistant \ABN derivative lacking the pepA, pepB, and pepN genes 
(5) were grown in 200 ml of LB medium at 37°C until the ODggo reached 
0.5. Cells were collected by centrifugation, washed with a buffer (20 mM 
Tris-HCl [pH 8.0], 10 mM MgCl2, 100 mM KCl), resuspended in 4 ml of 
the same buffer with the addition of 1 mM DTT, and disrupted by soni- 
cation. The lysate was centrifuged at 30,000 X g for 30 min at +4°C. The 
supernatants (total protein concentrations of 4 and 6 mg/ml for BW25 1 1 3 
and \ABN lysates, respectively) were divided into 200-jliI aliquots and 
stored at — 70°C until further use. 

tRNA aminoacylation reactions. S30 extracts (4 fjl) were mixed with 
2 fj.\ of reaction mixtures that had completed adenylation (see above) or 
0.1 mM McC^"'^ and incubated for 15 min to allow processing. At this 
point, 16 lA of reaction mix (30 mM Tris-HCl [pH 8.0], 30 mM KCl, 
8 mM MgClj, 1 mM DTT, 3 mM ATP), 5 mg/ml total tRNA (tRNA from 
E. coli MRE 600; Roche Diagnostics), and 40 jxM i*C-Asp (6) were added 
and reactions proceeded at room temperature for 5 min. The reaction 
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products were precipitated on filters with cold trichloroacetic acid (10% 
final concentration) and subjected to scintillation counting. 

MccA peptide processing. A 2.5-^1 volume of 5 mM MRTGNAN 
peptide and a l.S-fjl volume of 5 mM MDHIGFN peptide were mixed and 
combined with 5 jA E. coli BW25113 305 extract. The reaction mixtures 
were incubated for 0, 5, 15, and 60 min at room temperature and termi- 
nated by combining 1-(li1 reaction aliquots with 99 fxl of 0.5% trichloro- 
acetic acid. The reactions were next directly analyzed by mass spectrom- 
etry. 

Sequence analysis. Sequences from 2,262 completely sequenced bac- 
terial and archaeal genomes available in the Refseq database (26) (as of 
February 2013) were assigned to the ThiF/HesA/MoeB/El superfamily 
using the RPS-BLAST program (matching profiles for pfam00899 and 
COG0476 with an E value cutoff of 0.0 1 and with low-complexity filtering 
off) as implemented in a Conserved Domain Database search (27) The 
BLASTClust program (28) set up with a length coverage cutoff of 0.8 and 
a score coverage threshold (bit score divided by alignment length) of 0.8 
was used for clustering. One sequence was chosen for each cluster for 
further analysis. Genes encoding MccB and PaaA as well as aU those that 
were experimentally analyzed in this work were added to the sequence set. 
For all selected genes, five up- and downstream genes were collected for 
analysis of gene neighborhoods and assigned to profiles corresponding to 
those in the matching COG (29) and Pfam (30) databases using the RPS- 
BLAST program as indicated above. The Position-Specific Iterated (PSI)- 
BLAST program (31) with an inclusion E value cutoff of 0.0 1 was used to 
retrieve homologs of MccB and PaaA starting from several queries. For 
each query, the homologs with the best scores were collected until the first 
gene whose neighborhood suggested its involvement in biological pro- 
cesses unrelated to microcin production was identified. In addition, 10 
more sequences most similar to those representative of all the queries 
taken together were added into the final set of 60 sequences. Multiple 
alignment was built for this set using the MUSCLE program (32) followed 
by minor manual corrections, and one incomplete sequence was dis- 
carded. The length of the region (a potentially corresponding peptide 
clamp domain) upstream of the ThiF/HesA/MoeB/El domain was also 
measured. The confidently aligned blocks corresponding to the ThiF/ 
FlesA/MoeB/El domain with 136 informative positions were used for 
maximum-likelihood tree reconstruction using the FastTree program 
(33) with default parameters: ITT evolutionary model, discrete gamma 
model with 20 rate categories. The RAxML program (34) with the WAG 
substitution matrix and gamma-distributed evolutionary rates was used 
for the same alignment to build the phylogenetic tree to verify the out- 
group position. Both programs were employed to calculate the respective 
bootstrap values. Short peptides, the mccA candidates, were predicted if 
they were encoded immediately upstream of the respective MccB/PaaA- 
Irke genes and contained at least one asparagine residue. 
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